Towards Automatic Evaluation of Metadata Quality in Digital Repositories
نویسندگان
چکیده
Thanks to recent developments on automatic generation of metadata and interoperability between repositories, the production, management and consumption of metadata is vastly surpassing the human capacity to review or process this information. However, we need to assure that low quality metadata does not compromise the performance of the services that the repository provides to its users. We contend there is a need for automatic assessment of the quality of metadata in digital repositories, so tools or users can be alerted about low quality records. In this paper, we present several quality metrics for metadata based on quality evaluation frameworks used for human quality review. We applied these metrics to a sample of records from a real repository and compared the results with the quality assessment given to the same records by a group of human reviewers. Through correlation and regression analysis, we found that one of the metrics, the text information content, could be used as a predictor of the human evaluation. While these metrics are not proposed as a definitive measurement of the complete multi-dimensional quality of the metadata record, we present ways in which they can be used to enhance the functionality of digital repositories.
منابع مشابه
Towards Automatic Evaluation of Learning Object Metadata Quality
Thanks to recent developments on automatic generation of metadata and interoperability between repositories, the production, management and consumption of learning object metadata is vastly surpassing the human capacity to review or process these metadata. However, we need to make sure that the presence of some low quality metadata does not compromise the performance of services that rely on th...
متن کاملترسیم نقشه دانش حوزه کتابخانههای دیجیتالی در ایران: تحلیل همرخدادی واژگان
This study aimed to knowledge mapping of Digital Libraries (DLs) field in Iran. This is a scientometrics study. In this regard, Social Network and co-word analysis methods were used. 554 research resources such as books, national and international journal papers, conferences articles, and MA and Ph.D. Theses in Iran up to 2013 were studied. Researcher made checklist was used to collext data. Al...
متن کاملAutomatic Keyword Extraction for Learning Object Repositories
Introduction Learning object repositories are digital collections of educational materials, e.g., lectures, notes, presentations, which can be used to support learning. The main purpose of such repositories is to improve the sharing and reusability of the learning objects, which can be defined as “any digital resource that can be reused to support learning” (Wiley, 2000, p. 7). An important asp...
متن کاملAutomated Metadata in Multimedia Information Systems: Creation, Refinement, Use in Surrogates, and Evaluation
Improvements in network bandwidth along with dramatic drops in digital storage and processing costs have resulted in the explosive growth of multimedia (combinations of text, image, audio, and video) resources on the Internet and in digital repositories. A suite of computer technologies delivering speech, image, and natural language understanding can automatically derive descriptive metadata fo...
متن کاملارزیابی تطبیقی کارایی ساختار فراداده نظامهای شناسگر دیجیتالی
The main solution to the problems of persistency and uniqueness in identification of digital objects in a web environment is provided by using digital identifiers instead of URL. The main basis of this solution is resolution mechanism that is used in digital identifier systems. Resolution is the use of indirect names instead of URLs; what worked for the DNS (Domain Name System) in stabilizing i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007